Spotting Overgeneration Suspects

نویسندگان

  • Claire Gardent
  • Eric Kow
چکیده

We present a method for quickly spotting overgeneration suspects (i.e., likely cause of overgeneration) in hand-coded grammars. The method is applied to a medium size Tree Adjoining Grammar (TAG) for French and is shown to help reduce the number of outputs by 70% almost all of it being overgeneration.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The typological consequences of weighted constraints

A common ‘typological criterion’ on linguistic models is that they should predict (almost) all observed patterns while minimizing overgeneration. For optimizationbased models, it has been argued that constraints should be ranked rather than weighted to minimize overgeneration. Recently, however, weighting has been shown to elegantly capture patterns that ranking misses. To evaluate the issue, w...

متن کامل

Overgeneration and Ranking for Spoken Dialogue Systems

We describe an implemented generator for a spoken dialogue system that follows the ‘overgeneration and ranking’ approach. We find that overgeneration based on bottom-up chart generation is wellsuited to a) model phenomena such as alignment and variation in dialogue, and b) address robustness issues in the face of imperfect generation input. We report evaluation results of a first user study inv...

متن کامل

Principle-Based Parsing without Overgeneration

Overgeneration is the main source of computational complexity in previous principle-based parsers. This paper presents a message passing algorithm for principle-based parsing that avoids the overgenera-tion problem. This algorithm has been implemented in C++ and successfully tested with example sentences from (van Riemsdijk and Williams, 1986).

متن کامل

Phonological Overgeneration in Paninian System

In this paper an attempt is made to study the problem of overgeneration that is caused by the application of the system of P ān. ini The system of P ān. ini is made up of certain rules stated by him and his commentators namely, Kātyāyana and Patañjali. These rules are supposed to produce the forms that are used in the language, i.e. Sanskrit. However, sometimes the technical application of thes...

متن کامل

In Proceedings of ACL - 93 PRINCIPLE - BASED PARSING WITHOUT OVERGENERATION 1

Overgeneration is the main source of computational complexity in previous principle-based parsers. This paper presents a message passing algorithm for principle-based parsing that avoids the overgenera-tion problem. This algorithm has been implemented in C++ and successfully tested with example sentences from (van Riemsdijk and Williams, 1986).

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007